Control of exploitation–exploration meta-parameter in reinforcement learning

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Control of exploitation-exploration meta-parameter in reinforcement learning

In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance between exploitation and exploration. Our learning scheme is based on model-based RL, in which the Bayes inference with forgetting effect estimates the state-transition probability of the environment. The balance parameter,...

متن کامل

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...

متن کامل

Exploring parameter space in reinforcement learning

This paper discusses parameter-based exploration methods for reinforcement learning. Parameter-based methods perturb parameters of a general function approximator directly, rather than adding noise to the resulting actions. Parameter-based exploration unifies reinforcement learning and black-box optimization, and has several advantages over action perturbation. We review two recent parameter-ex...

متن کامل

Reinforcement Learning in Control

During its melt cycle, an arc furnace causes disturbances of the electrical supply. Existing measurement techniques for this application lead to corrective rather than predictive compensation. The use of neural networks to control the compensation is being considered, in particular reinforcement learning strategies which require no pre-training and which can adapt to a dynamically changing envi...

متن کامل

Evolution of Meta-parameters in Reinforcement Learning

A crucial issue in reinforcement learning applications is how to set meta-parameters, such as the learning rate and ”temperature” for exploration, to match the demands of the task and the environment. In this thesis, a method to adjust meta-parameters of reinforcement learning by using a real-number genetic algorithm is proposed. Simulations of foraging tasks show that appropriate settings of m...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neural Networks

سال: 2002

ISSN: 0893-6080

DOI: 10.1016/s0893-6080(02)00056-4